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Abstract 

We study capital process behavior in the fair-coin and biased-coin games in the 
framework of the game-theoretic probability of Shafer and Vovk (2001). We show 
that if Skeptic uses a Bayesian strategy with a beta prior, the capital process is 
lucidly expressed in terms of the past average of Reality's moves. From this it is 
proved that the Skeptic's Bayesian strategy weakly forces the strong law of large 
numbers (SLLN) with the convergence rate of 0(ydogn/n) and if Reality violates 
SLLN then the exponential growth rate of the capital process is very accurately 
described in terms of the Kullback divergence between the average of Reality's 
moves when she violates SLLN and the average when she observes SLLN. We also 
investigate optimality properties associated with Bayesian strategy. 

Keywords and phrases: Azuma-Hoeffding-Bennett inequality, beta-binomial distribution, 
exchangeability, game-theoretic probability, hyper geometric distribution, Kullback diver- 
gence, prior distribution. 



1 Introduction 

Coin tossing is the most basic object in the study of probability theory. This is also true 
in the new field of game-theoretic probability and finance established by Shafer and Vovk 
(2001). In fact they start with discussing the fair-coin game in Section 3.1 of their book. 
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Recently Kumon and Takemura (2007) [6J, motivated by Takeuchi's works (|14j. [15]), 
proved that a very simple single strategy, based only on the past average of Reality's 
moves, is weakly forcing SLLN with the convergence rate of 0(Wlogn/n), which is a 
substantial improvement over the original strategy of Shafer and Vovk. Versions of SLLN 
for unbounded moves by Reality is obtained in Kumon, Takemura and Takeuchi (2007) [7] . 

In this paper for general biased-coin games, we consider a class of Bayesian strategies 
for Skeptic. As in Kumon and Takemura (2007) we prove that Bayesian strategies in 
the class weakly force SLLN with the convergence rate of 0(y log n/n). Furthermore we 
establish the important fact that if Skeptic uses a Bayesian strategy and Reality violates 
SLLN, then the exponential growth rate of the Skeptic's capital process is very accurately 
described in terms of the Kullback divergence between the average of Reality's moves 
when she violates SLLN and the average when she observes SLLN. 

In the protocol of the coin-tossing game of Shafer and Vovk (2001), there is no prob- 
abilistic assumption on the behavior of Reality. In the games, Skeptic tries to become 
rich and Reality tries to prevent it. However in the Bayesian strategy, Skeptic simply and 
naively assumes that Reality behaves probabilistically and for choosing his moves Skeptic 
uses the Bayesian prediction of Reality's moves. It is a remarkable fact that this naive 
Bayesian prediction by Skeptic actually works and forces SLLN even if Reality's moves 
are not probabilistic at all and Reality tries to beat Skeptic as an adversary. Furthermore 
Skeptic achieves an optimal growth rate if Reality violates SLLN in a way accounted for 
by the prior. As in the standard statistical decision theory (e.g. Berger [2] and Robert 
[8]), this optimality is inherent in Bayesian procedures. However in the setting of the 
present paper, only a very simple protocol of the game is assumed and no other modeling 
assumptions are made on Reality's moves. In this sense, we believe that the optimality 
considered in this paper has much broader conceptual implications than those offered by 
the standard statistical decision theory. 

The organization of this paper is as follows. In Section [2] we formulate coin-tossing 
games and set up some notations. In Section [3] we formulate Bayesian strategy of Skeptic 
for a given probability distribution on the space of Reality's moves. In particular we 
consider Bayes procedures with respect to beta-binomial distribution and hypergeometric 
distribution. In Section @] we show that Skeptic's capital process is expressed in a closed 
form when he uses a Bayesian strategy of Section [3) Then using Stirling's formula we prove 
that the exponential growth rate of the capital process is naturally described by means 
of the Kullback divergence. In Section [5] we consider the case that Skeptic is restricted 
to only buy the tickets in coin-tossing games. In Section [6] some numerical examples are 
presented. We end the paper with some concluding remarks in Section [71 

2 Notations on coin-tossing game 

Here we prepare some notations on coin-tossing games. We consider a general biased-coin 
game between Skeptic and Reality in the following parametrization. In the protocol, the 
head and the tail of coin are coded as 1 and 0, respectively. Furthermore the probability 
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of heads < p < 1 is given. 

Biased-Coin Game 
Protocol: 

/Co := 1. 

FOR n = 1,2,...: 

Skeptic announces M n e BL 

Reality announces x n G {— p, 1 — p}. 

/C„ = /C„_i + M n x n . 
END FOR 

Skeptic starts with the initial capital /Co = 1- For each round n, Skeptic first announces 
M n , which is the number of tickets he buys. x n stands for a ticket which sells for the price 
of p and it pays the amount of one if Reality chooses heads and nothing if she chooses 
tails, p is called the risk neutral probability (e.g. Takemura and Suzuki (2007)). Except 
for Section [5] we consider the case that Skeptic is allowed to sell the tickets (M n < 0). 
M n x n is the increment of Skeptic's capital from round n — 1 to n. The case p = 1/2 is 
the fair-coin game. Although most of our results can be presented in the fair-coin game, 
it is instructive to consider the biased-coin game for the additional asymmetry. 

Let £ n = xi . . . x n denote the sequence of Reality's moves up to time n. h n denotes the 
number of heads and t n = n — h n denotes the number of tails in £ n . Furthermore write 

Then s n = h n — pn and 

h n = s n + pn, — = p + x n , t n = n - (s n + pn), — = 1 - p - x n . (1) 
n n 

In the above biased-coin game, as discussed below, Reality is forced to observe SLLN 
x n — ► or equivalently to follow the risk neutral probability h n /n — ► p. 
In the following we write 



M n 



/C n -i 

and express the recursive relation of the capital process in the multiplicative form K n = 
/C n _i(l + v n x n ). Then starting from the initial capital /C = 1, the capital process is 
written as 

n 

/C„ = + Vixi). 
i=i 

As in Shafer and Vovk (2001), we can think that Skeptic chooses his strategy V which 
specifies v± = z^(xi, . . . , £j„i) as a function of x%, . . . , Note that Skeptic has to choose 

< v i < - (2) 



1 - p p 
if he has to avoid bankruptcy (/Q < 0). 
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In the protocol above there is no probabilistic assumption on the behavior of Reality. 
Suppose however that Skeptic models Reality's moves x\, . . . ,x n by a probability distri- 
bution Q = Q n on {— p, 1 — p} n . We denote the conditional expectation of Xi under Q 
given xx,. . .,x^x by 

= e q( x i I a*, ... , =pf - p, i = l,...,n, 

where 

p? = . . . , = Q(xi = 1 - p | aci, . . . , aci-i) 

is the conditional probability of heads. 



3 Bayesian strategy and its optimality 

In this section, we first discuss optimality inherent in a Bayesian strategy. It is not obvious 
how to formulate optimality of Skeptic's strategies. Allowing equalities in (j2J), for each 
fixed path of Reality's moves £ n = x\ . . . x n up to time n, the optimum moves of Skeptic 
are given as 

1/P if a* = 1-/0 , 3 v 

-1/(1 - p) if Xi = -p 

for i — 1, . . . , n, with the resulting capital K n = p~ hn {l — p)~ fn . However this argument 
is clearly meaningless because in each round Skeptic has to choose i/j first and Reality 
chooses Xi after seeing z^. For Skeptic's strategy V and a path £ n , we denote his log 
capital by log/C^(£ n ) and denote a weight of the path by w(£ n ) > 0. Then we consider 
the weighted average of log/C^(£ n ) and evaluate Skeptic's strategy V by 

£ ^(niog/c^n. (4) 

£ n e{-p,i-p} n 

There is no loss of generality in assuming 1 = p i- P } n w (£ n )- Therefore {^(£ n )} is 

a probability measure Q and (T4J) can be written as 

E Q [logK(0}- (5) 

For a given Q the optimum V = Vq maximizing dSJ) is given as follows by a standard 
backward induction argument of dynamic programming. We call Vq the Bayesian strategy 
for Q. 

Theorem 3.1. The optimum strategy V = Vq maximizing |3]] is given by 

( x P?~P x? E Q {xi | Xl ,...,Xi-i) . 

n{x 1 ,...,x i -i) = — r = ^ r = s , i = l,...,n. (6) 

p(l-p) p(l-p) Pl-P 
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Proof. Consider the optimum move of Skeptic at the last round n, given all previous 
moves of the players. The terms of 



i=l 



are all fixed except for the last term log(l + v n Xn). Consider maximizing the conditional 
expected value g{y) = Eq (\og(l + vx n ) \ Xi, . . . , x n _i). g(v) is a concave function of v 
and under the convention = OlogO, g{y) is maximized by solving g'iv) = as long as 
this equation has a solution in [—1/(1 — p), 1/p]- Now 



9 \ V ) = PnT—T, IT - I 1 - Pn 



Solving this we have 



^n(.-^lj • • • i -En—l; 



Pn~ P £ n 



p(i-p) p{i-py 



Note that (p® — p)/p(l — p) G [—1/(1 — p), 1/p]. Therefore the optimum move of the 
Skeptic at the last round i = n is given by ([6]). Now for the round % = n — 1 we can do the 
same argument based on the conditional distribution of x n _i given x\, . . . , x n -2- Then by 
backward induction ([6]) is proved for all % — n — 1, . . . , 1. □ 

In the beginning of this section we argued that ([3]) for a fixed sequence £ n = X\ . . . x n 
is meaningless. However Theorem 13.11 reduces to ([3]) if Q is a point mass at a particular 
£ ra . This shows that the optimality in Theorem 13.11 for a given Q is in a sense a trivial 
inherent optimality of a Bayes procedure and the important question is the choice of the 
probability measure Q. In the following we consider various choices of Q, including those 
whose support is a proper subset of {— p, 1 — p} n . 

The obvious first candidate for Q is the Bernoulli model, where Reality is modeled to 
choose her move independently for each round as Q(xi = 1 — p)=p=l — Q(x{ = —p). 
Then 

Q(n=p""(i-p)S 

where h n and t n are the numbers of heads and tails in £ n = x\ . . . x n . In this case pf = p 
for all i and Ui = (p — p) / p(l — p) is a constant. We see that the fixed e-strategy of Chapter 
3 in Shafer and Vovk (2001) corresponds to this case. 

From Bayesian viewpoint it is natural to consider a mixture of the Bernoulli model 
with respect to a beta prior, which is the natural conjugate to the binomial distribution. 
Suppose that Skeptic has a prior beta distribution tt on the success probability p of the 
Bernoulli model. 

<dp) = —^-—p a - 1 (l-pf- l dp, a,/3>0. (7) 
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We can think of a and (3 as prior numbers of heads and tails. Then 



B(a + h n ,(3 + t n ) 



B(a,f3)J v ^ " B(a,P) 

(T(a + h n )/r(a)) x (T(P + t n )/T(i3)) 

T{a + p + n)/T(a + p) 1 ' 

is the beta-binomial distribution and we call this model "beta-binomial model" and call 
the Skeptic's associated strategy "beta-binomial strategy" . In this model 

~q _ B(a + hn-x + l,/? + t w _i) a + h n _ x 

Pn ~ B(a + 1^-1,0 + ^1) ~a + /3 + n-l' U 

and it follows that 

u i= P r\ = t {l ~^ a ~ P ^ 3i -\, i = l,...,n. (10) 

Finally we consider a hypergeometric model, whose support may be a proper subset 
of {— p, 1 — p} n . This is a somewhat artificial model, but it is useful as a benchmark in 
considering optimality of the beta-binomial strategy. As we see below the hypergeometric 
model has an opposite characteristic to the beta-binomial model. Let N > n and < 
M < N be positive integers. 

Consider an urn containing M red balls and N — M black balls. Skeptic models 
Reality's behavior in such a way that Reality draws a ball from the urn one by one without 
replacement and she chooses heads if the ball is red in the z-th drawing. Therefore Skeptic 
is considering an initial part of a finite horizon game with N rounds. Under this model 

c,tt»\ - 1 O ( V) _ < M! /' M - ''")'' x <v± ~ MML - M - '"'') ,„) 
m) GIL) O m/(N- n y. • ( > 

0<h n <M, < t n < N - M, 

and 

Ag = g(xi...a; w _i(l-p)) = (M-jn-QI (N-n)\ M - hn-i ( . 
Pn Q( Xl . . . x n _0 (M-h n .. 1 -l)l(N-n + l)\ N-n + 1 1 ' 

Since the hypergeometric model holds for each i < n, it follows that 

_ M - - p(i - 1) - p(N - 2 + 1) _ M - pAT - 

(jv - i + i) P (i - p) (N-i+i) P (i- P y [ } 

Note the similarities between (jSJ) and ( TTTj) and between ([9l) and ( 1T21) . If we put 

a = -M, P = -(N-M), (14) 

then (ED coincides with (TJJ). 



In order to make the correspondence clearer, define 



(a) m = a(a + 1) • • • (a + m — 1), (a) = 1, 

for real a and non-negative integer m. For a > 0, (a) m = T(a + m)/T(a). Then with the 
correspondence ([HP , both (jSj) and ffTTl) can be written as 

Q(C) = ( f k . (is) 

We see that the hypergeometric model is the "negative" of the beta-binomial model. In 
the beta-binomial model is an increasing function of /i n _i, whereas in the hypergeo- 
metric model is a decreasing function. Another way of understanding the connection 
is the Polya's urn model (e.g. Section V.2 of Feller (1968), Takemura and Suzuki (2005)). 
Hypergeometric model corresponds to sampling without replacement, Bernoulli model cor- 
responds to sampling with replacement and beta-binomial model corresponds to Polya's 
urn model where a ball with the same color is added at each draw. 

The extreme case of the hypergeometric model is the case N = n. Then Q is con- 
centrated on £ n with the number of heads exactly equal to M. Among the exchangeable 
models, in the sense that they assign the same probability to all £ n with the same number 
of heads, this model is most "meaningless" as in the very first example of this section. 
Note that the support of the hypergeometric model is the whole sample space {— p, 1 — p} n 
if and only if n < min(M, N — M). One way of justifying the hypergeometric model is 
to add a requirement in the protocol of the game, such that Reality has to choose her 
path in the support of Q. By this requirement Reality's move space is decreased and 
the game becomes more favorable to Skeptic. This implies that the hypergeometric case, 
in particular the extreme case N — n, can serve as an upper bound to Skeptic's capital 
process. We should also mention de Finetti's theorem ([1], [13J), which states that an 
infinite sequence of exchangeable 0-1 random variables has to be a mixture of infinite in- 
dependent Bernoulli trials. We see that the hypergeometric model is naturally associated 
with a finite horizon game. 



4 Capital precess of Bayesian strategy 

In this section we investigate capital process when Skeptic uses a Bayesian strategy. We 
first give a general formula for Skeptic's capital as a ratio of the probabilities of Reality's 
path under the assumed distribution and under the risk neutral probability distribution. 
This gives us a closed form expression of the capital process for the beta-binomial model 
and the hypergeometric model of the previous section. Then for the beta-binomial model, 
we use Stirling's formula to describe the exponential growth rate of the capital process. 
It leads to the Kullback divergence and proves that the beta-binomial strategy weakly 
forces SLLN with the convergence rate of 0(\J\ogn/n). Finally we consider optimality 
of the beta-binomial strategy in relation to the hypergeometric model. 
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The following theorem shows that the capital process for a Bayesian strategy can be 
written as the ratio of probabilities under the assumed model and under the risk neutral 
probability measure. 



Theorem 4.1. Let Q = Q n be a probability measure on the set of paths of length n 
and let Vq den 
Vq is given by 



and let Vq denote the Bayesian strategy for Q. The value of the capital process C Q for 



p nn {± — py n 
where < p < 1 is the risk neutral probability. 

Proof. We prove ( |T6l) by induction on n. Recall that Vi = (pf — p)/p(l — p). Consider 
n = 1. If x\ = 1 — p, then 

JC^ = l + u 1 (l-p) = l + P l^ = P l= Q ^ = 1 - p) . 

P P P 

On the other hand if X\ = —p, then 

I -Pi Q(x 1 = -p) 



JC[ Q = l-v x p 



p 1 - p 



This proves (TT6|) for n = 1. 

Now suppose that ( TL6l) holds up to n — 1. Then 



As in the case of n = 1 it holds that 

1 ^rv^n ~ 



Q(x n = 1 - p | e : )/p if x n = 1 - p 

Q(^ = -pir^)/(i-p) if x„ = -p. 



Therefore ffTB]) holds also for n. □ 
Corollary 4.1. For i/ie beta-binomial model and the hypergeometric model 

IT'Pq ( C n \ — \ ( a )h„(P)t„ /.-x 



Remark 4.1. Formulation of Theorem \4-l\ in the present form was suggested by Vladimir 
Vovk to one of the authors in a discussion during the 16th international conference on 
algorithmic learning theory. Theorem \4-l\ actually follows from some general facts, in- 
cluding the equivalence of game-theoretic martingales and measure-theoretic martingales 
(Section 8.2 of Shafer and Vovk (2001)), expressing positive martingales with expected 
value of 1 as likelihood ratios and the non-negativeness of the Kullback divergence. 
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We now use Stirling's formula to prove that the beta-binomial strategy weakly forces 
SLLN. Let a > 0,/3 > in ffTTj) . The log capital process is written as 



log = -h n \ogp - t n log(l - p) + logI> + h n ) - lo g r(a) 

+ io g r(/5 + t n )-io g r(/3) 

- lo g r(a + /3 + n) + logr(a + 
If both h n and t ra are large, we can use Stirling's formula 

logT(a;) = (^x — logx — x + logv^r + 0(x~ 1 ). 

More precisely for all x > 

< logT(a;) — (^x — log x — x + log v^r < • 

For notational simplicity write n' = a + (3 + n, h' n = a + h n , t' n = (3 + t n . Then 

io g r(^) + bgr(0-iogr(n') 

-^log^ + ClogC-.Mog.'4log^ + log^ + o(-^ 

= ^ log ^ + 4 log % - ~ log ^ + log + O f . \ - 
For < p, q < 1, let 

p 1 — p 

g) =plog- + (l-p)log- 

q l-q 
denote the Kullback divergence between p and q. Then log /C^ is written as 



log*- = n>D(^\\ P ) - \ logM + *(„,„) +0 (— -L_ 

where 

c (a, /?) = -log5(a,/3) + a logp + /31og(l - p) + logv / 27r. 
By the Taylor expansion 

D(p + 6\\p) = (p + S) log(l + -) + (1 - p - 5) log(l - T 6 



(If 



p7 v r y & V 1-pV 2p*(l-p*)' 

where p* is some value between p and p + 5. Recall that h n /n = p + x n . Then with 5 = x n 
we have 

1 f TlX^ \ 

" -logn + o(nx£) + 0(1). (19) 



2Vp(l-p) 
Hence we obtain the following result. 
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Theorem 4.2. When Skeptic follows the beta-binomial strategy (a > 0, (3 > 0) given by 

lim sup (liminf) K% = oo 



if and only if 



lim sup (liminf) — — ^—r — \ogn = oo. 

VpU-p) 



A sufficient condition for lim sup„ JC^ = oo is 

lim sup ^1^1 > a/p(1 - pj, 
n y/iog n 

and a necessary condition for lim sup n = oo is 

\/n\x n \ 



lim sup — - > y/p(l - p). 

n Vl°g n 

Note that lim/C^ = oo if and only if liminf K.^ = oo. This theorem states that the 
Bayesian strategy ( fTUl) weakly forces that x n converges to 0. The convergence rate is 
0(y/logn/n) and the the convergence factor is a/ p(l — p). 

We also note that from the log expression (}T9]) . the capital K% behaves as 

*Z^n\ A = —^—(^-p{l-p) 
2/o(l -p) V lo g^ 

Hence we know that, if SLLN holds (x n — > 0) with a convergence rate slower than 
0(y/logn/n), then Skeptic's capital grows faster than any polynomial order of n. 

Remark 4.2. As clarified by the above argument, the Bayesian strategy which is plain 
in itself, greatly simplifies the proof and derivation of the strong law of large numbers for 
coin-tossing games. Furthermore Skeptic has a wide choice of prior distributions on the 
behavior of Reality, although the question of optimal choice of prior distributions seems 
to be a difficult problem. Capital processes for various values of a, [3 are illustrated by 
numerical examples in Section^ 

Remark 4.3. As discussed in Remark 2 of Kumon and Takemura ff^/, the convergence rate 
of 0(\J\ogn/n) in Theorem 4-2 is closely related to Azuma-Hoeff ding-Bennett inequality, 



which is a large deviation type inequality for bounded martingale differences. See Azuma 
(1967), Dembo and Zeitouni (1998), and Appendix A. 7 of Vovk, Gammerman and Shafer 
(2005). 

Remark 4.4. We also point the importance of game-theoretic results that the question of 
the optimal growth rate of Skeptic 's capital cannot even be asked in the standard measure- 
theoretic probability theory. This question can be asked in the algorithmic theory of ran- 
domness, see e.g. Schnorr (1970) (1971), Vovk (1987). But game-theoretic probability 
theory does not suffer from the two serious disadvantages of the algorithmic theory of ran- 
domness: the arbitrary constants and the heavy restrictions on the allowed sample spaces 
caused by considerations of computability. 
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For the rest of this section we consider performance of the beta-binomial strategy 
compared to the hypergeometric model. As remarked at the end of Section [3], the extreme 
case N = n of the hypergeometric model can serve as an upper bound for the beta- 
binomial models. We first state this in the following proposition. 

Proposition 4.1. For every exchangeable probability Q on the set of paths of length 
n, the following inequality holds for each £ n . 

' >*Z Q (C). (20) 



p hn (l — p) tn n\ 

Proof. The left-hand side is the value of the capital process for the hypergeometric prior 
with n = N. The prior is the uniform distribution over the set of paths {£ n } with the same 
number h n of heads. For exchangeable Q, the right-hand side is constant for each path 
£ n with the same number h n of heads. Then the inequality follows from the optimality of 
the left-hand side for the hypergeometric prior with n = N . □ 

Write 

1 h H ' 



n-"rf 



p hn (l — p) tn n\ 

Conceptually we need to consider /C* for each n and the number of heads h n , separately. 
Let denote the capital process for the beta-binomial model. Then /C*(£ n )//C°'^(£) is 
written as 

K\t n \/n\ 



/C^(e) B(a + h n ,p + t n ) 

Therefore 



B(a,/3)>1. 



log^Cr) - logK^r) = log/in! + logt„! - logn! 

- ( log V(a + K) + log r(/3 + t n ) - log V(a + p + n)) 
+ \ogB(a,P). 

Stirling's formula for log x\ is written as 

< log x\ - ( x + ^) lo g( x + 1) — (a; + 1) + log \Phx 



< 



12(x + l) 



Hence we have 



log £*(£") - log/C^(D = (h n + I) \ogh n -(h n + a-~) \og(h n + a) 

+ (tn + -) logt n - (t n + /3 - -) \og(t n + (3) 

- (n + log n + (n + a + (3 - log(n + a + 0) 
+ ]og B(a,p) + o(—l rT 

\ 111111 V i I'm 
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The right-hand side is further simplified as 



^1 — a) logh n — a + (1 — (3) logi„ — /3 — (1 — a — 0) \ogn + (a + (3) 

1 



+ log B(a,P) + 



mm(h' n ,t' n 



hgn + (1 - a) log — + (1 - (3) log - + log B{a, /3) + 0\ ' 



n n ' \mm(h' n ,t' n 

We summarize the above calculation in the following proposition, which states that /C* (£ n ) 
surpasses /C"'^(£ n ) only by a polynomial factor of n. 

Proposition 4.2. 



*B(a,0)(£) ~°(^) ~ (1 + 0(1/ min(^, Q). 



Considering the fact that Skeptic can achieve XT*(£ n ) only in "hindsight" (i.e. after 
seeing the number of heads h n for each n) and the fact that /C*(£ n ) and JC^' 13 ^ 71 ) grow 
exponentially when Reality violates SLLN, we see that the beta-binomial strategy is close 
to optimum. 



5 Capital process for the one-sided case 

In this section we assume the protocol as before, but now Skeptic is required to choose 
M n > 0. We consider Skeptic's strategy V + with 

vf =max(^,0), (21) 

where is given by ffTU]) . We can also consider the negative part strategy V~ with 
v~ = min(^,0). 

In the following, we investigate the relation between the behavior of the original capital 
process K£ and the one-sided capital process JC^ + - If M n or u n changes the sign only 
finite number of times, then the behavior of /C^ + is fairly trivial. If v n is eventually all 
non- negative, then K.„ and /C^ + are asymptotically equivalent. On the other hand if v n 
is eventually negative, then /C^ + stays constant, whereas the behavior of /C^ is described 
in Theorem 14.21 Therefore we will consider the case that v n changes the sign infinitely 
often. At first, by writing 

s n = (1 - p)a - pp + s n , 

we can express 

_ Sn-l 

Vn ~ ( a + P + n-l)p(l- p)' 
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Noting that at time n when s n changes the sign, 

n \n 



we start at sufficiently large time n Q such that h no /n ~ p, and proceed to divide the 
sequence {s n } into the following two types of blocks. For uq < k + 1 < I, consider a block 
{k + 1, . . . , /}. We call it a non-negative block if 

s k < 0, s fe+ i > 0, s k+2 > 0, . . . , §i > 0, s i+ x < 0. 

Similarly we call it a negative block if 

Sk > 0, s fc+ i < 0, s fc +2 < 0, . . . , §i < 0, s i+1 > 0. 

By definition, negative and non-negative blocks are alternating. 

We first consider a particularly simple case of the fair-coin game p = 1/2 and sym- 
metric prior a = (3. In this case s n = s n = when s n changes the sign, and the lengths of 
the blocks are always even numbers. Then for each block {k + 1, ...,/} we have h k = t k , 
hi = t[ and the capital ratio JCf //C^ is expressed as 



*T = 22m (a + h k ) 2 m = t-t 1 2(a + h k +j) 
K% (2a + 2h k ) 2m f_A 2 (a + /i fc + j) + 1 



3=0 

where I — k = 2m is an even number. As for the one-sided capital ratio, lCf + //C^ + = 
K,f //C^ during a non- negative block, and K,f + //C^ + = 1 during a negative block. There- 
fore /C^ < /C^ + holds for all n in the non-negative block. 

Now we consider a general biased-coin game. When s n changes the sign, we have to 
consider overshoot of order 0(l/n). Therefore we need to carefully bound the capital 
ratio ICf //C^ of V and the capital ratio fCf + /fC^ + of V + from above and below for each 
negative or non-negative block. This is conducted based on the log capital formula given 
by (USD, that is 

log^ = n>D§\\ P ) - i.ogM + M + o(— ^ 
In the above, when n = k or I, 



K „/l\ £ 

r?/ 

and when 5 is small, as was noted in Section HJ 
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Therefore with 5 = x n = 0(l/n) we have 

n' 



oi 1 - 



Hence we get for n = k or Z 

I I . / r 

so that it follows 

-v 

which implies that there exists a constant C > depending only on a, (3, and p such that 



1, Z C , /Cf 1, z c 

-2 log ^-¥ <log ^r < -2 log fc + r 

We apply this relation for successive non-negative and negative blocks by noting the 
approximation formula 

1 1 1 [ n dx 1 , n 1 

+ •■■ + -</ — + — = log ■ 



m m + 1 n j m x m mm 

Then we obtain at an end point rii of any block, 

C log < log — ^ < -- + C log h 



2 J n n K% V 2 J n n 



When we reach an end point n\ after passing sufficiently many blocks, the one-sided 
capital ratios /Cf + //C^ + behave in the same way as JCf/JC^ during non- negative blocks 
and stay one during negative blocks, so that we can also bound log /C^ ; + //C^ Q + in the 
following manner. 

( 1 n\ i n i C ^ i ^ m n ' , C 

~o - ^ M°g < lo sW+ < Clog — + — . 

From the above two relations, it follows that at a sufficiently large end point ni, the 
one-sided log capital log/C^ + differs from the original log capital log/C^ at most O(logrty). 
This fact implies that the two log capitals log /C^ + and log /C^ behave similarly except for 
O(logn). Thus from Theorem 14.21 we obtain the following result. 

Theorem 5.1. The one-sided Bayesian strategy V + given by l[21\) weakly forces the one- 
sided version of SLLN with the convergence rate 0(\J\ogn/n) and with the convergence 
factor a/ p(l — p), that is 

limsup ^~~ n > p{\ — ~p) =>. limsup/C^ + = oo. (22) 

n a/ log U n 
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6 Some numerical examples 



In this section we illustrate capital processes of our strategies for two data sets. The first 
data set is based on the Nikkei 225 stock average index for 500 days starting January 
2000. We set x n > if the opening price of the n + 1-st day was higher than the opening 
price of the n-th day. The second data set is based on the first 500 digits in the fractional 
part of 7T - 3 = 0.141592653589793.... We set x n > if the n-th digit is in {5, ... , 9}. 

For the Nikkei data we have h^o = 221 heads and t 500 = 279 tails for the 500 days. The 
values of log/C 50 o for the hypergeometric model (HG) and the beta-binomial strategies 
for p = 1/2, 2/3, 2/5, a = = 1, 100, 500, together with the positive-part (PP) and 
negative-part (NP) strategies, are tabulated in Table 1. We see the exponential growth 
of the capital process for p = 2/3. For this example it seems to to be advantageous to 
take a = (3 = 100. 



Table 1: Log capital process at n = 500 for Nikkei 225 



a, (3 


p = l/2 


p = 2/3 


p = 2/5 


HG 


6.698416 


56.24544 


5.145427 


1.0, 1.0 


0.4818099 


50.02884 


-1.071180 


(PP) 


-1.712525 


0.0 


-1.071180 


(NP) 


2.194335 


50.02884 


0.0 


100, 100 


1.781788 


51.32881 


0.2287981 


(PP) 


-0.1556388 


0.0 


0.2287981 


(NP) 


1.937426 


51.32881 


0.0 


500, 500 


0.9195455 


50.46657 


-0.633444 


(PP) 


-0.03559586 


0.0 


-0.633444 


(NP) 


0.9551413 


50.46657 


0.0 



We plot the entire log capital processes of the beta-binomial strategy with its positive 
part and negative part for the case of p = 1/2, a — (3 — 100 in Figured! The log capital 
process of the beta-binomial strategy is plotted with a solid line, that of the positive-part 
strategy is plotted with a dashed line, and that of the negative-part strategy is plotted 
with a dotted line. We see that for this example the beta-binomial strategy is close to the 
negative-part strategy. The log capital of the positive-part strategy stays constant after 
about n = 90. 

For the digits of ir, we have h^oo = 239 heads and t^oo = 261 tails. The number of 
heads and tails are more balanced for this data set than the Nikkei 225 case above. Table 
2 gives the same information as in Table 1 for this data set. We see the same tendency 
in Table 2 as in Table 1, although a = (3 = 500 performs better than a = (3 — 100. 

As in Figure 1, the log capital processes of the beta-binomial strategy with its positive 
part and negative part for the case of p = 1/2, a = (3 = 100 are plotted in FigureEl The 
log capital of the positive-part strategy stays constant after about n = 140. 
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Figure 1: Nikkei 225 data (p = 1/2, a = /3 = 100) 



Table 2: Log capital process at n — 500 for the digits of n 



a, P 


p=l/2 


p = 2/3 


p = 2/5 


HG 


3.816784 


40.88716 


9.562166 


1.0, 1.0 


-2.399822 


34.67056 


3.345560 


(PP) 


-0.9942046 


0.0 


3.957064 


(NP) 


-1.4056175 


34.67056 


-0.6115032 


100, 100 


-0.2810085 


36.78937 


5.464374 


(PP) 


-0.1820164 


0.0 


5.464374 


(NP) 


-0.09899215 


36.78937 


0.0 


500, 500 


-0.04136915 


37.02901 


5.704013 


(PP) 


-0.04499401 


0.0 


5.704013 


(NP) 


0.003624851 


37.02901 


0.0 



7 Concluding remarks 

In this paper we have shown that for general biased-coin games, some simple Bayesian 
models provide explicit strategies of Skeptic which weakly force the strong law of large 
numbers with the convergence rate of 0{^\ogn/n) and whose capital processes can be 
analyzed in detail, leading naturally to the Kullback divergence. 

We treated coin tossing and beta-binomial model for simplicity and for the sake of 
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Figure 2: 500 digits of tt (p = 1/2, a = (3 = 100) 



explicit computations. However we expect that many of the results of this paper can be 
generalized to multidimensional cases and more general prior distributions. Theorem 14.11 
should hold for games with unique risk neutral probability. 

We have only considered exchangeable priors Q. Strategies based on exchangeable 
priors can not exploit some block patterns or Markov dependencies of Reality. For example 
in the fair-coin game, the beta-binomial prior can not exploit the following pattern of 
heads: 1, 0, 1, 0, 1,0,.. .. In order to exploit a variety of patterns in Reality's moves, we 
need to use a prior Q which contains hyperparameters corresponding to these patterns. 
One might consider mixtures of priors, corresponding to various block patterns, Markov 
dependence of various orders, etc. We can take priors covering various higher order 
patterns and can analyze optimalities with respect to such priors. Another interesting 
direction would be to extend our results to other limit theorems, such as the law of the 
iterated logarithm. These subjects will be treated in our subsequent works. 
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